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ARTICLE INFO ABSTRACT 

Article history: The spike (S) protein of human coronavirus NL63 (HCoV-NL63) mediates both cell attachment by bind- 
Received 24 March 2011 ing to its receptor hACE2 and membrane fusion during virus entry. We have previously identified the 
Received in revised form 24 June 2011 receptor-binding domain (RBD) and residues important for RBD-hACE2 association. Here, we further 
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characterized the S protein by investigating the roles of the cytoplasmic tail and 19 residues located in 
the RBD in protein accumulation, receptor binding, and pseudotype virus entry. For these purposes, we 
first identified an entry-efficient S gene template from a pool of gene variants and used it as a back- 
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ieee bone to generate a Series of cytoplasmic tail deletion and single residue substitution mutants. Our results 
spike protein showed that: (i) deletion of 18 aa from the C-terminus enhanced the S protein accumulation and virus 
Receptor binding entry, which might be due to the deletion of intracellular retention signals; (ii) further deletion to residue 


29 also enhanced the amount of S protein on the cell surface and in virion, but reduced virus entry by 
25%, suggesting that residues 19-29 contributes to membrane fusion; (iii) a 29 aa-deletion mutant had 
a defect in anchoring on the plasma membrane, which led to a dramatic decrease of S protein in virion 
and virus entry; (iv) a total of 15 residues (Y498, V499, V531, G534, G537, D538, S540, G575, S576, E582, 
W585, Y590, T591, V593 and G594) within RBD were important for receptor binding and virus entry. 
They probably form three receptor binding motifs, and the third motif is conserved between NL63 and 
SARS-CoV. 


Pseudotype virus entry 


© 2011 Elsevier B.V. All rights reserved. 


1. Introduction 


Human coronavirus NL63 (HCoV-NL63, NL63) was first 
described in 2004 in the Netherlands (van der Hoek et al., 2004). 
However, it was already isolated in 1988 (Fouchier et al., 2004), 
or even earlier (van der Hoek et al., 2006). NL63 has since been 
reported in more than 15 countries across the world and accounts 
for up to 9.3% of acute respiratory infections leading to hospital- 
ization (Albuquerque et al., 2009; Arden et al., 2005; Bastien et al., 
2005; Chiu et al., 2005; Dare et al., 2007; Ebihara et al., 2005; Esper 
et al., 2005; Gerna et al., 2006; Han et al., 2007; Koetz et al., 2006; 
Moes et al., 2005; Oosterhof et al., 2009; Smuts et al., 2008; Vabret 
et al., 2005). These findings suggest that NL63 has been widely 
circulating in the human population for decades. NL63 infection 
generally causes mild upper respiratory tract diseases, but may 
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also cause more severe lower respiratory tract diseases, e.g. croup, 
bronchiolitis, and pneumonia in young children, the elderly, and 
immunocompromised people (van der Hoek et al., 2006). No vac- 
cine or antiviral drug is currently available for NL63. 

Coronaviruses can be divided into three groups based on anti- 
genic and phylogenetic relationships. NL63 belongs to group I (van 
der Hoek et al., 2004). Like other coronaviruses, the 5’ proximal 
two-third of NL63 genome is occupied by two large replicase genes 
1a and 1b, while the 3’ proximal one-third encodes four struc- 
tural protein genes: spike (S), envelope (E), membrane (M), and 
nucleocapsid (N). These genes are arranged in a conserved order 
5’-1a-1b-S-E-M-N-3’ (Fig. 1). 

The S protein of coronaviruses forms a layer of long petal-shaped 
surface spikes that give the virions a crown-like appearance when 
visualized by electron microscopy (Lai and Holmes, 2001). S pro- 
tein is anchored on the viral envelope, a lipid bilayer derived from 
the intracellular membrane during virus budding. S protein is a 
type I transmembrane protein containing a signal peptide, long N- 
terminal ectodomain, transmembrane (TM) domain, and a short 
C-terminal cytoplasmic tail (Fig. 1). The signal peptide directs the 
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Fig. 1. Schematic diagram of genome organization of NL63 and domain architecture of spike (S) protein. Signal: signal peptide; RBD: receptor-binding domain; FP: fusion 
peptide; HR: heptad repeat; TM: transmembrane domain; Cyto: cytoplasmic tail. Twenty-one residues that have been previously shown to be important for RBD-hACE2 
interaction are indicated within three receptor binding motifs (RBMs) (Li et al., 2007; Lin et al., 2008). 


nascent S polypeptide to the endoplasmic reticulum (ER), where 
the signal peptide is cleaved and monomeric precursors are synthe- 
sized. The precursors are heavily glycosylated to yield 150-200 kDa 
mature monomers, which are further oligomerized into trimers 
within the ER (Masters, 2006). S trimers interact with M protein 
through their cytoplasmic tails and are transported to the ER-Golgi 
intermediate compartment (ERGIC) where assembly and budding 
occur (Hogue and Machamer, 2008). This is in contrast to the assem- 
bly and budding of many enveloped viruses, e.g. HIV, that occur on 
the plasma membrane (Chen and Lamb, 2008). 

In groups 2a, group 3 and some group 1 coronaviruses, the S 
proteins are cleaved in the Golgi complex by a furin-like protease 
into noncovalently associated S1 and S2 subunits prior to assembly 
into the virion (de Haan et al., 2008; de Haan and Rottier, 2005). The 
S protein of NL63 does not have furin-recognition site and is thus 
not cleaved during biogenesis. Nevertheless, the $1 (21-717 aa) and 
S2 (718-1356 aa) domains can still be deduced based on sequence 
similarity. 

The S1 domain of coronavirus mediates cell attachment through 
interaction with a cellular receptor. As such, the particular 
S1-receptor interaction largely determines the host range, tissue 
and cell tropisms of coronaviruses. In addition, S1 is the major 
determinant for eliciting neutralizing antibody, which in turn 
blocks virion-receptor binding and protects animals from infection 
(Gallagher and Buchmeier, 2001; Wentworth and Holmes, 2007). 
Many coronaviruses have variations in $1 that evade immune 
response. Amazingly, variations as small as one amino acid have 
been shown to change receptor usage, tissue tropism and patho- 
genesis (Ballesteros et al., 1997; Krempl et al., 2000). The S2 domain 
mediates membrane fusion during viral entry into host cells. It 
contains several conserved regions characteristic of class I viral 
fusion proteins: fusion peptide (FP), two heptad repeats (HRs), TM 
domain and several conserved cysteine residues in the cytoplas- 
mic tail (Fig. 1). Each of these structural features has been shown 
to contribute to membrane fusion (Wentworth and Holmes, 2007). 

The primary entry receptor for NL63 was identified as human 
angiotensin-converting enzyme 2 (hACE2), which is also the 
primary receptor for group 2b SARS-CoV. This is particularly inter- 
esting since the S proteins of these viruses share only 21% amino 
acid identity (Hofmann et al., 2005; Li et al., 2003). Although they 
use the same receptor for entry, these two viruses cause differ- 


ent clinical outcomes. In contrast to NL63 that generally causes 
mild respiratory diseases, such as croup in children (van der Hoek 
et al., 2005; Wu et al., 2008; Sung et al., 2010), SARS-CoV is highly 
pathogenic and is characterized by acute lung failure with high 
mortality (Drosten et al., 2003; Ksiazek et al., 2003; Peiris et al., 
2003; WHO, 2004). 

This contrast has generated wide interest in comparing the roles 
of S proteins of these two viruses in hACE2 binding and entry. The 
receptor-binding domain (RBD) has been mapped to amino acid 
residues 318-510 for SARS-CoV (Wong et al., 2004), and 476-616 
for NL63 (Li et al., 2007; Lin et al., 2008). The RBDs of these two 
viruses bind toa largely overlapping region of hACE2 (Liet al., 2007), 
although the receptor-binding affinity of NL63 S is much less com- 
pared with SARS-CoV S (Lin et al., 2008; Mathewson et al., 2008). In 
addition, SARS-CoV S and NL63 S may cause differential downreg- 
ulation of hACE2 level by induced shedding of its ectodomain from 
the cell surface (Haga et al., 2008; Glowacka et al., 2010). Down- 
regulation of hACE2 is considered to be a leading cause of acute 
lung failure (Imai et al., 2005; Kuba et al., 2005). Finally, SARS- 
CoV S, but not NL63 S, is cleaved by endosomal cathepsin L, and 
such proteolytic cleavage is important for SARS-CoV entry (Huang 
et al., 2006). It should be mentioned that these studies were greatly 
facilitated by the utilization of lentivirus- or VSV-based pseudotype 
virus bearing coronaviral S proteins. These S-mediated pseudotype 
viruses have identical cell tropisms to the authentic coronaviruses, 
and their entry is also solely dependent on cellular receptors (Broer 
et al., 2006; Fukushi et al., 2005; Giroglou et al., 2004; Hofmann 
et al., 2004; Li et al., 2003, 2005b; Moore et al., 2004; Nie et al., 
2004; Schwegmann-Wessels et al., 2009; Simmons et al., 2004). 
The use of pseudotype viruses not only provides a safe tool to 
analyze the entry process of highly pathogenic viruses under con- 
ventional biosafety condition, but also provides a unique system 
where the role of S protein in entry is not interfered by other viral 
proteins. 

Nevertheless, compared to the tremendous efforts focused on 
SARS-CoV S, little attention has been given to the S protein of NL63. 
Here, we established an efficient pseudotype virus system for NL63 
and used it to characterize the contributions of various regions 
of NL63 S protein in mediating plasma membrane translocation, 
incorporation into pseudotype virions, receptor association, and 
pseudotype virus entry. 
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2. Materials and methods 
2.1. Virus and cells 


The NL63 virus was kindly provided by Dr. L. van der Hoek at the 
University of Amsterdam, the Netherlands, and was propagated in 
a rhesus monkey kidney cell line, LLC-MK2. Human 293T cells were 
used for protein expression, pseudotype virus production, and viral 
entry assay. Huh-7 (a human hepatoma cell line) was used as target 
cells for pseudotype virus infection. These cell lines were grown in 
Dulbecco’s modified Eagle’s medium (DMEM) supplemented with 
10% fetal bovine serum and 100IU of penicillin-streptomycin at 
37°C with 5% COp. 


2.2. Construction of expression plasmids 


In this study, the domain architecture of NL63 S protein 
was defined by using different computer programs (Fig. 1). 
The border of S1 and S2 domain was deduced from Pfam 
(http://www.sanger.ac.uk/Software/Pfam/). The signal peptide 
was predicted by web-based SOSUlIsignal (http://bp.nuap.nagoya- 
u.ac.jp/sosui/sosuisignal/sosuisignal_submit.html). The 
boundaries of FP and HR were defined as_ described 
previously (Zheng et al, 2006), whereas the bound- 
aries of the TM domain were predicted by TMpred 
(http://www.ch.embnet.org/software/TMPRED_form.html). The 
full-length wild type S gene of NL63 was obtained from NL63- 
infected LLC-MK2 cells by RT-PCR. cDNA was synthesized using 
total RNAs extracted from the infected cells using RNeasy Mini kit 
(QIAGEN), oligo (dT)1;2-1g and Superscript II reverse transcriptase 
(Invitrogen) according to the manufacturer’s instructions. The PCR 
products were amplified with Pfx DNA polymerase (Invitrogen) 
and cloned into mammalian cell expression vector pCAGGS (kindly 
provided by Dr. J. Miyazaki from the Kumamoto University Medical 
School of Japan). Based on the entry efficiency of 23 randomly 
selected S gene variants, No. 5 variant was chosen for further study. 
S gene mutants with C-terminal truncations or single amino acid 
substitution were generated using an overlapping PCR strategy. 
Two plasmids that express soluble NL63 S1/His and hACE2/Ig 
fusion protein, respectively, have been described previously (Lin 
et al., 2008). All these plasmid constructs used for this study were 
confirmed by DNA sequencing. 


2.3. Production of pseudotype virus 


HIV-based pseudotype viruses were produced as described else- 
where (Gorry et al., 2002). 

Briefly, each well of 293T cells of a 6-well plate were co- 
transfected with 5 wg of pHIV-luc, 0.2 wg of pTat, 0.2 wg of pRev and 
different amounts of NL63 S gene plasmids (or the plasmid encod- 
ing SARS S or VSVG) following the standard calcium phosphate 
transfection method. At 48h post-transfection, 1.5 ml of medium 
was collected and centrifuged at 5000rpm for 5min to remove 
large cell debris. The supernatant was further purified by passing 
through a 0.45 1.m pore size filter. The viral titer was measured 
using a reverse transcriptase (RT) assay described previously (Rho 
et al., 1981). Briefly, 300 wl of pseudotyped virus was concen- 
trated by centrifugation at 13,200rpm for 2h at 4°C. The virus 
pellet was resuspended with 10 wl of RT buffer (50 mM Tris-HCl 
(pH 7.5), 1mM DTT, 20% glycerol, 0.25M KCl and 0.25% Triton 
X-100) and mixed with 40 wl of RT cocktail consisting of 50 mM 
Tris-HCl (pH 7.5), 7.5 mM MgCl, 0.05% Triton X-100, 4.8 mM DTT, 
1 pl Poly(A)xdTj5 (Roche Diagnostics, Indianapolis, IN) and 1 pl 
TTP [methyl-7H] (PE Life and Analytical Sciences, Boston, MA). The 
mixture was incubated at 37°C for 1h and spotted on Whatman® 
2.3cm filter paper for air drying at room temperature for 15 min. 


After 3 successive 5 min washes with SSC buffer and a brief rinse 
with 95% ethanol, the filter paper was allowed to air dry on 
aluminum foil for 20min. 5 ml of scintillation solution (National 
Diagnostics) was added into a LSC vial with the filter paper for 
radioactivity measurement. The titer was measured by scintillation 
counting and expressed as the mean count per minute (cpm)/ml. 


2.4. Virus entry assay 


For 96-well plate format, 5000cpm of pseudotype virus was 
added into each well containing Huh-7 cells and incubated at 37°C 
for 2h, whereas 20,000 cpm of pseudotype viruses were added to 
each well when infection of Huh-7 or 293T cells transiently trans- 
fected with an expression plasmid encoding human ACE2 receptor 
(293T/hACE2 cells) was performed in a 24-well plate. The cells were 
washed twice with phosphate buffered saline (PBS) to remove the 
unbound viruses. 100 zl of new medium was added into each well 
followed by two days of further incubation. The cells were then 
lysed with 50 pl of lysis buffer (Promega), incubated at room tem- 
perature for 15min and centrifuged at 13,000rpm for 2 min. For 
measurement of luciferase (Luc) activity, 10 wl of cell lysate was 
mixed with 50 pl luciferase substrate (Promega). Luc activity was 
measured using a luminometer (Tuner Biosystem 20/20"). 


2.5. Protein expression, purification and preparation of anti-NL63 
S1 antibody 


For production of recombinant protein, 293T cells were trans- 
fected with expression plasmid DNA using the standard calcium 
phosphate transfection method. Soluble NL63 S1 protein was puri- 
fied as described previously (Lin et al., 2008) and used to immunize 
rabbits for preparation of anti-NL63 S polyclonal antibody. 


2.6. Western blot and immunoprecipitation assay (IP) 


NL63 S protein expressed in cells or in pseudotype virus par- 
ticles was detected by anti-NL63 S1 antibody following a method 
described previously (Lin et al., 2008 ). To detect the expression level 
of S protein in cells, 5 wg of total protein lysate was adjusted to 
a total volume of 32 wl with PBS. The cell lysate was mixed with 
8 wl 5x denaturing buffer, boiled for 15min and loaded onto an 
8% denaturing SDS-polyacrylamide gel. To detect the amount of S 
protein incorporated into pseudotype virus, 300,000 cpm of virus 
was centrifuged at 13,000 rpm at 4°C for 2 h. The pellet was resus- 
pended in 32 wl of PBS and was used for SDS-PAGE analysis. The 
electrophoresis running condition was 80V for 30min, followed 
by 120V for 150 min. As a loading control, the HIV p24 protein 
was also detected. For this purpose, 100,000 cpm of pseudotype 
virus was used for western blotting with anti-p24 antibody. The 
density of protein bands was quantified using Quantity One soft- 
ware (BioRad ). The association between NL63 S protein in cell lysate 
and soluble hACE2/Ig was measured by IP according to a previously 
described method (Lin et al., 2008). 


2.7. Flow cytometry 


To detect the S protein expression level on the cell surface, flow 
cytometry was performed as described previously (Lin et al., 2008). 
In brief, 293T cells were transfected with 1 wg/each well of the S- 
expressing plasmid DNA in a 24-well plate. After two days post- 
transfection, the cells were detached, and the two duplicated wells 
of cells were pooled together. The cells were incubated with anti- 
NL63 S1 antibody on ice for 1h, followed by three washes with 
IF buffer. They were further incubated with FITC-conjugated anti- 
rabbit IgG on ice for 30min. The cells were washed three times 
with IF buffer, fixed with 1% formaldehyde, and analyzed with a 
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Beckman Coulter FC5000 Cytofluorometer. The X-mean fluorescent 
values were used for plotting. 


2.8. Structural modeling 


Homologous domains in the RBDs of SARS-CoV and NL63 
were predicted by four proteomic analysis programs, i.e. NOMAD, 
MAFFT, T-COFFEE and Dialign. The structure model of SARS-CoV 
RBD-hACE2 complex was retrieved from protein data bank (PDB). 
Modeling was performed using Wincoot (Emsley and Cowtan, 
2004) and PyMol Viewer (http://www.pymol.org). 


3. Results 


3.1. Generation and characterization of the pseudotype viruses 
bearing NL63 S variants 


Previous studies suggested that NL63 S mediated viral entry 
activity was substantially lower than that of SARS S. However, 
the underlying mechanisms remain largely unknown. To estab- 
lish an efficient NL63 pseudotype virus entry system, we first 
needed to choose an entry-efficient S gene template. RNA viruses 
are well known to exist as quasispecies, a heterogeneous popu- 
lation that is composed of closely related sequence variants with 
minor nucleotide variation (Domingo et al., 2005). The ability to 
mediate virus entry may vary significantly among different vari- 
ants. To obtain an S variant that is highly efficient in mediating 
virus entry, we cloned the RT-PCR products of the S gene into a 
pCAGGS vector and randomly picked 23 colonies. The plasmid DNA 
from each variant was co-transfected with a non-replicative HIV 
genome harboring a luciferase gene into 293T cells. The result- 
ing pseudotype viruses were quantified and used to infect Huh-7 
cells. Luciferase activity was measured as an index of entry level or 
infectivity, and was normalized to the highest observed level. As 
illustrated in Fig. 2A, variant No. 5 consistently showed the high- 
est efficiency in virus entry. Twelve variants had an entry level less 
than 20% of No. 5 and 6 variants had <40% of entry level. Variant No. 
5 was thus selected for further study and is referred to as wild type 
S (wt S) hereafter. Sequence analysis of this variant showed that 
it contained three amino acid substitutions (A685 — V, L853 — F 
and Q998 — K) compared to the reference standard sequence in 
GenBank (accession No. NC_005831). 

We further optimized the pseudotype virus system by test- 
ing different amounts of wt S plasmid DNA. As shown in Fig. 2B, 
the amount of S DNA had little effect on the virus yield, but 
had a profound effect on the infectivity of resulting virus. The 
optimal amount of DNA for virus infectivity was 1.0 wg in each 
well of a 6-well plate. To understand the molecular basis of this 
dose-dependent effect, we measured the amount of S protein 
incorporated into pseudotype virus particles. Since the S pro- 
tein with a C-myc epitope fused either at the N- or C-terminus 
failed to produce infectious pseudotye virus (not shown data), 
we purified soluble S1 protein and prepared a rabbit polyclonal 
antibody. This antibody was used to detect NL63 S protein here- 
after. As a loading control, HIV-1 p24 protein was probed with 
a p24-specific antibody to confirm that similar amounts of pseu- 
dotype virus were used in each assay. The result revealed a 
positive correlation between the infectivity and amount of S pro- 
tein incorporated into pseudotype virus particles (Fig. 2B and 
C). 


3.2. The cytoplasmic tail of NL63 S could affect the production 
and cellular entry of the pseudotyped viruses 


The C-terminal cytoplasmic tail of coronaviral S proteins con- 
tains signals required for intracellular trafficking, assembly, and 


membrane fusion (Hogue and Machamer, 2008). A 19 amino acid 
(aa) deletion from the C-terminus of SARS-CoV S protein increased 
the pseudotype virus titer by 100-fold (Giroglou et al., 2004). To 
see if similar enhancement occurs in NL63 pseudotype virus, we 
generated four truncated S mutants, AC8, AC18, AC29 and AC38, 
corresponding to 8, 18, 29 and 38 aa deletion from the C-terminus, 
respectively (Fig. 3A). These mutants were then tested for virus pro- 
duction and cell entry. As shown in Fig. 3B, mutants AC8 and AC18 
mediated higher levels of virus entry than wt S. However, even the 
best deletion mutant, AC18, was only about 1.5 times higher than 
wt S. Further deletions were deleterious to the infectivity as the 
entry levels of AC29 and AC38 were 75% and 3% of wt S, respec- 
tively. There is a positive correlation between the viral entry levels 
and the amount of S protein incorporated into virions, with the 
exception AC29 (Fig. 3C). 

To further investigate the role of cytoplasmic tail in pseu- 
dotype virus entry, we measured the S protein expression level 
and receptor binding ability of these mutants. Interestingly, the 
protein expression level of these mutants was higher than wt 
in cell lysate (upper panel in Fig. 3D). This was especially obvi- 
ous for AC18, AC29 and AC38, implying that the C-terminal 18 
residues of S protein harbor inhibitory signal(s) for protein accu- 
mulation. This result also ruled out the possibility that the small 
amount of AC38 S protein detected in virions was due to a pro- 
tein expression defect. Since it is the S protein bound to the plasma 
membrane that is assembled into pseudotype virus, we checked 
the level of S protein on the cell surface using flow cytometry. 
The results showed that the expression level on the cell surface 
was positively correlated with the amount of S protein incorpo- 
rated into virions (lower panel in Fig. 3D). Therefore, the entry 
defect of AC38 is attributed to its defect in cell surface accumu- 
lation. This data also suggests that C-terminal 38aa region of S 
plays an important role in anchoring to the plasma membrane. We 
also did an IP assay to test the receptor binding ability of these 
mutants. Based on the relative expression level (upper panel in 
Fig. 3D), we adjusted the volume of cell lysate in order to use 
similar amounts of S protein for the binding assay. The analysis 
revealed similar receptor binding ability among these mutants in 
three repeated experiments (Fig. 3E). This is expected since only 
the S1 domain of coronaviral S protein is involved in receptor bind- 
ing while the S2 domain mediates membrane fusion during virus 
entry. 


3.3. Cellular entry mediated by NL63 S, SARS S, and VSVG 


While our data clearly demonstrated that the NL63 S variant #5- 
AC18 could significantly enhance the viral infectivity, we did not 
know its potency compared with SARS S and VSVG. It is well-known 
that VSVG-pseudotyped viruses could enter a variety of mam- 
malian cells very efficiently. Indeed, when we infected Huh-7 cells 
with same amount of these pseudotyped viruses (20,000 cpm/well 
in a 24-well plate), we consistently observed that VSVG mediated 
viral entry activity was about 100 times higher than that of SARS 
S and NL63 S (data not shown). Therefore, in order to compare the 
differences between NL63 S and SARS S and adjust the scales of 
VSVG mediated entry activity, we used 10 times less VSVG viruses 
(i.e. 2000 cpm/well) while the amount of NL63 and SARS viruses 
remained the same (20,000 cpm/well). Interestingly, we found that 
the viral entry of NL63-AC18 was 2-3 folds higher than that of 
SARS S in Huh-7 cells (Fig. 4A). However, when 293T/hACE2 cells 
were infected by these viruses, we observed that the SARS S medi- 
ated viral entry were 2-3 folds higher than that of NL63 S (Fig. 4B). 
In contrast, there was no significant difference in the entry activ- 
ity of VSVG-pseudotyped virus between Huh-7 and 293T/hACE2 
cells. 
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Fig. 2. Development of NL63 pseudotype virus system. (A) Pseudotype virus entry of 23 randomly selected NL63 S gene variants. The Luc activity was measured and 
normalized to that of variant No. 5 that consistently showed the highest virus entry level. (B) Yield and infectivity of pseudotype virus made by transfection of different 
amounts of pCAGGS vector and wt S plasmid DNA (0.04-4.0 jg in each well of 6-well plate). (C) Amount of S protein incorporated into virions. The density of protein bands 
was quantified by Quantity One, and the relative values were shown at the bottom of the gel. HIV-1 p24 protein was detected as a loading control. 


3.4. Characterization of critical residues in the NL63 RBD for viral 
entry 


Previously, we identified 15 residues (C497, Y498, V499, C500, 
K501, R518, R530, V531, G534, G537, D538, S540, E582, W585 and 
T591) ina minimal RBD (aa 476-616) that are important for hACE2 
binding by IP assay. These residues fall into three clusters that might 
represent three receptor binding motifs (RBMs) (Lin et al., 2008). 
Using the same method, Li et al. (2007) found that substitutions 
at 7 residues (G575, S576, N578, Y590, T591, V593, and G594) in 
a larger RBD (aa 301-749) affected the receptor binding ability. 
Interestingly, these residues fall into the proposed RBM3. How- 
ever, six of them were tested via a double-substitution strategy, 
i.e. they were combined in three mutants, G575/S576, G590/T591 
and V593/G594. Thus, the contribution of individual residues to 
receptor binding remained unclear. Also, all of these residues were 
only tested in the context of RBD-receptor interaction. To further 
define their roles in receptor binding and virus entry, we gener- 
ated a total of 19 single substitution mutants mostly by replacing 
these residues with alanine in AC18, a more physiologically rel- 
evant context (Fig. 1). Residues C497 and C500 were not tested 
since it was hard to uncouple their roles in receptor binding and 
maintaining proper structure by forming disulphide bonds. These 
S mutants were then tested for their S protein accumulation and 
abilities to mediate virus entry and receptor association. 


As shown in Fig. 5, most of these mutants were expressed at a 
similar level in cell lysate compared to AC18. Based on the extent 
of entry level and receptor binding ability, mutants were placed 
into four categories. First, R530A behaved almost the same as AC18 
in every aspect we examined (Fig. 5A). Second, K501A and R518A 
showed a slight reduction in entry level (92-96%) and receptor 
binding ability (60-70%) (Fig. 5A). Third, N578 — Y enhanced the 
entry level (144%) and receptor binding ability (220%) (Fig. 5B). 
Fourth, the remaining 15 mutants experienced a more pronounced 
drop of entry level (<60%) and receptor binding ability (<40%). 
Therefore, in the context of a full-length S, 15 residues within RBD 
(Y498, V499, V531, G534, G537, D538, $540, G575, S576, E582, 
W585, Y590, T591, V593 and G594) were shown to be important 
for receptor binding and virus entry. They still form three RBMs, as 
found in the context of a minimal RBD (Lin et al., 2008). In addition, 
8 mutants, W585R, G575A, V593A, G594A, V499L, D538A, E582A 
and T591A, also showed pronounced reduction of S protein in viri- 
ons (<40%). In the former four mutants, the reduced S protein in 
virions may be due to the decreased cell surface expression; how- 
ever, in the latter four mutants, their cell surface expression was 
not reduced, therefore the reduction may reflect a defect in virion 
assembly. These observations suggest that substitutions at these 
8 sites may change the structural conformation of S protein and 
adversely affect its cell surface transportation/accumulation and/or 
stability. 
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Fig. 3. Analysis of the cytoplasmic tail of S protein in protein accumulation, receptor binding and pseudotype virus entry. (A) Schematic diagram of the truncated S mutants. 
(B) Pseudotype virus entry. (C) Amount of S protein incorporated into virions. (D) S protein accumulation in cell lysate and on cell surface. (E) Receptor binding ability. The 
density of protein bands was quantified by Quantity One, and the relative values were shown at the bottom of each gel. 


4. Discussion 


In this report, we established an efficient NL63 pseudotype virus 
entry system. An S gene variant efficient in virus entry was chosen 
according to the quasispecies nature of RNA viruses (Fig. 2A). Amaz- 
ingly, 18 out of 23 randomly selected variants had <40% of the entry 
level of the best variant. This reinforces the necessity to select an 
efficient S gene template for research. The infectivity of pseudo- 
type virus was further promoted by truncation of the cytoplasmic 
18 aa tail of NL63 S protein (Fig. 3). A similar phenomenon was 
observed in the pseudotype virus systems of SARS-CoV and TGEV 
(Fukushi et al., 2005; Giroglou et al., 2004; Schwegmann-Wessels 
et al., 2009). However, compared to the 50-100-fold enhancement 
of infectivity by C-terminal deletion in SARS-CoV (18-19 aa dele- 


tion) and TGEV (14aa deletion), the maximum enhancement by 
18 aa deletion in NL63 S was only 2 folds (Fig. 3B). This difference 
presumably reflects the nature of different S proteins. Virus infec- 
tivity was positively correlated with the amount of S protein in 
virions. Similar to the truncated S proteins of SARS-CoV and TGEV, 
NL63 mutants AC8 and AC18 were more efficient than wt full- 
length Sin assembly into pseudotype virions (Fig. 3C), which should 
be due to higher expression levels on plasma membrane, the site 
of assembly and budding of lentivirus (Fig. 3D). But why did the 
removal of 18aa from the C-terminus of S protein promote cell 
surface expression and overall protein accumulation in cell lysate? 
This may be related to the intracellular retention signals contained 
within the cytoplasmic tail region. Bioinformatics analysis reveals 
that the cytoplasmic tail of NL63 S contains a putative dibasic ER- 
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Fig. 4. Viral entry mediated by NL63 S, SARS S and VSVG. Huh-7 (A) and 
293T/hACE2 cells (B) were infected with same amount of pseudotyped NL63 and 
SARS viruses (20,000 cpm/well in a 24-well plate) and less amount of VSVG viruses 
(2000 cpm/well), respectively, and luciferase activity was measured at 48h post- 
infection. 


targeting signal (KVHVQ) and a putative tyrosine-based endocytic 
signal (YYEF) at the very C-terminus (Lontok et al., 2004) (Fig. 3A). 
Such tyrosine-based endocytic signal allows membrane protein to 
be sorted from cell surface to endosomes and lysosomes for degra- 
dation (Bonifacino and Jackson, 2003). It has been demonstrated in 
infectious bronchitis virus (IBV) that the dibasic ER-targeting sig- 
nal allows the S protein to be retained near the assembly site when 
expressed alone at low levels, but a high level of expression satu- 
rates the machinery and reaches to the plasma membrane (Lontok 
et al., 2004). Also in IBV, the endocytic signal was able to rapidly 
internalize the S protein from plasma membrane (Younetal., 2005). 
SARS-CoV S protein does not have such an endocytic signal at its 
cytoplasmic tail, and is accordingly expressed on the plasma mem- 
brane. Introduction of this signal into SARS-CoV S protein caused it 
to be retained intracellularly (Schwegmann-Wessels et al., 2004). 
The roles of these two signals have not been demonstrated for NL63, 
but they most likely function in a similar fashion to those in IBV 
and SARS-CoV mutant. This can explain why the absence of an ER- 
targeting signal and part of an endocytic signal (the critical tyrosine 
is still maintained) in AC8 caused more protein accumulation on 
the plasma membrane and increased protein in cell lysate, while 
complete removal of these two signals in AC18 further facilitated 
its plasma membrane translocation and protection from endocyto- 
sis and lysosomal degradation (Fig. 3A and C). 

AC29 also showed enhanced protein accumulation in cell lysate, 
on plasma membrane and more efficient incorporation into virion. 


Furthermore, the deletion had little effect on receptor binding. 
However, the virus infectivity was reduced by 25% (Fig. 3). Com- 
pared to AC18, AC29 lacks 11 residues, NCLTSSMRGCC. This 
small region contains two cysteine residues conserved among all 
the coronaviruses. The homologous regions in SARS-CoV (LKGAC- 
SCGSCC) and MHV (FKKCGNCC) have been shown to play a role in 
cell-to-cell membrane fusion (Chang et al., 2000; Peti et al., 2005). 
Taken together, we suggest that these 11 residues contribute to 
membrane fusion during NL63 entry. 

The overall protein accumulation in cell lysate of AC38 was sim- 
ilar to AC29; however, this mutant had a defect in accumulation 
on the plasma membrane, which might directly result in the drastic 
reduction of S protein in virions and virus infectivity (Fig. 3). There- 
fore, those residues (LSTGCCGCC) that are additionally deleted in 
AC38 may participate in membrane anchoring. The exact bound- 
aries of transmembrane (TM) domains of coronaviruses have not 
yet been experimentally specified. Different predictions include or 
exclude part of the cysteine-rich domains as part of the TM domain 
(Bosch et al., 2005; Broer et al., 2006; Chang et al., 2000; Godeke 
et al., 2000; Petit et al., 2007; Thorp et al., 2006; Ye et al., 2004; 
Zheng et al., 2006). According to TMpred, the strongly preferred 
TM model of NL63 S protein predicts a TM to be located between 
residues 1299 and 1316 (VWLIISVVFVVLLSLLVF), which does not 
include any cysteine-rich domains. This prediction was adopted for 
the TM boundaries used in this paper. However, based on SOSUI, 
another web-based prediction algorithm, the TM domain is located 
between residues 1303 and 1325 (ISVVFVVLLSLLVFCCLSTGCCG), 
which includes 7 residues that were deleted in AC38. Our data 
seemingly supports the SOSUI prediction on TM domain boundary. 

We also compared the cellular entry mediated by NL63 S- 
AC18, SARS S, and VSVG. Our data demonstrated that both NL63 
S and SARS S could efficiently mediate viral entry into Huh-7 and 
293T/hACEZ2 cells (Fig. 4). Interestingly, NL63 S and SARS S exhib- 
ited significant differences in the viral entry into these host cells, 
indicating that additional host factors and/or co-receptors may be 
involved in this process. Our findings may represent an important 
advance in the development of an efficient NL63 pseudotype viral 
system for studies of the viral entry. 

A AC18-based assay system was further used to delineate the 
roles of 19 residues that were chosen based on our and others’ 
previous characterization in the context of truncated RBDs (Li et al., 
2007; Lin et al., 2008). These single residue mutants functioned 
differently (Fig. 5). Overall, their pseudotype virus infectivity was 
positively correlated with their receptor binding ability, although 
the infectivity of some mutants was interfered by their defects 
in plasma membrane translocation (e.g. W585R) or incorporation 
into virions (e.g. E582A). In terms of receptor binding, most of 
the mutants tested here functioned similarly between the context 
of a near full-length S and a small RBD. However, some residues 
that were important in the RBD (aa 476-616) were not or of 
limited importance in AC18. For instance, R530 > A abolished the 
RBD-hACE2 association (Lin et al., 2008), but the same substitution 
in AC18 had almost no effect on hACE2 binding and virus entry 
(Fig. 5). Likewise, K501 > A and R518—A reduced the receptor 
binding ability by 100% and 90% in RBD, respectively (Lin et al., 
2008), but same substitutions reduced the receptor binding by 
only 40% and 30%, respectively, and had very minor effect on virus 
entry (Fig. 5). Therefore, these three residues may be exposed on 
the surface of RBD but are partially or completely buried inside the 
S structure. This data is consistent with our previous finding that 
the minimum RBD had much higher receptor binding efficiency 
than S1 (Lin et al., 2008). 

The crystal structure information of SARS-CoV RBD-hACE2 com- 
plex and NL63-RBD-hACE2 complex has significantly advanced our 
understanding of the molecular interaction between the S proteins 
and hACE2 receptor (Li et al., 2005a; Wu et al., 2009, 2011). The 
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Fig. 5. The roles of 19 residues in protein accumulation, receptor binding and pseudotype virus entry. (A) Results for 13 residues that were previously identified in a minimum 
RBD context (aa 476-616) (Lin et al., 2008). (B) Results for 6 residues that were previously tested in double-substitution mutants in a larger RBD context (aa 301-749) (Li 


et al., 2007). 


data obtained from the NL63 RBD-hACE2 complex indicated that 
there are three receptor-binding motifs in the NL63 RBD (RBM1, 
493-513; RBM2, 531-541; and RBM3, 585-590) (Wu et al., 2009). 
These results are consistent with our previously published data 
suggesting that NL63 RBD may contain three receptor-binding 
regions (RI, RII, and RIII) (Lin et al., 2008 and Fig. 1). In addition, 
a panel of critical residues (such as Y498, G534, G537, S540, and 
W585) in the NL63 RBD that appears important for receptor bind- 
ing may have direct contact with hACE2 receptor (Lin et al., 2008; 
Wu et al., 2009). Moreover, our current study has provided func- 
tional data demonstrating that some of the residues in RBM1 (Y498, 
V499), RBM2 (G534, G537, D538, S540), and RBM3 (G575, S576, 
E582, Y590, V593, and G594) may indeed play important roles in 
the viral entry (Figs. 1 and 5; Wu et al., 2009). To further com- 
pare the molecular interactions between the SARS-RBA/hACE2 and 
NL63-RBD/hACE2, we had performed the structural modeling using 
several bioinformatics tools as described in Section 2 and identi- 
fied a short homology domain between SARS-RBD and NL63-RBD 
(Fig. 6A). Interestingly, in SARS, this domain is within the receptor- 
binding motif (RBM) that is in close contact with hACE2. Among 
the 14 receptor-binding residues revealed by the structure model 
of SARS-CoV RBD-hACE2 complex, 9 are located in this conserved 
region (underlined residues in Fig. 6A) (Li et al., 2005a). Among 
these residues, the importance of N479 and T487 has been demon- 
strated (Li et al., 2005b; Qu et al., 2005). More interestingly, this 


conserved domain in NL63 also contains 9 residues (underlined 
residues in Fig. 6A) that were experimentally shown to be critical 
for receptor binding and virus entry (Fig. 5). We therefore refer to 
the region as conserved binding motif (CBM). It has been shown 
that the RBDs of NL63 and SARS-CoV bind to a largely overlap- 
ping region in hACE2 (Li et al., 2007). Thus, it is possible that NL63 
CBM may interact with hACE2 in a conformation similar to SARS- 
CoV CBM. To support this scenario, we used the structure model of 
SARS-CoV RBD-hACE2 complex to explain the opposite activities 
of NL63 S mutants N578Y and E582A. The residue corresponding 
to N578 of NL63 in SARS-CoV is Y475 (Fig. 5A). Y475 is able to 
interact with three residues in hACE2, T27, K31 and Y83, among 
which, the interaction with Y83 is the strongest since they form 
a hydrogen bond (left panel in Fig. 6B) (Li et al., 2005a). After we 
mutated Y475 to N475, mimicking the site in NL63 (N578), it was 
no longer able to form a hydrogen bond with Y83 although it could 
still weakly interact with T27 and K31 through Van der Waals con- 
tacts (right panel in Fig. 6B). Conversely, if N578 of NL63 is replaced 
with tyrosine Y, a hydrogen bond is acquired. This model explains 
why N578Y showed enhanced receptor binding ability (Fig. 5B). 
It can also explain the drastically reduced receptor binding ability 
of E582A (Fig. 5A). The residue corresponding to E582 of NL63 in 
SARS-CoV is N479 (Fig. 6A). This residue is able to form a hydrogen 
bond with H34 in hACE2 (left panel in Fig. 6C) (Li et al., 2005a). 
However, if N479 is substituted with a glutamic acid residue, mim- 
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Fig. 6. Structure models of CBM-hACEZ2 interaction. (A) Conserved binding motif (CBM) of NL63 and SARS-CoV S protein (upper panel) and the structure model of SARS-CoV 
CBM-hACE2 interaction (lower panel). The underlined residues in SARS-CoV CBM represent those residues in contact with hACE2 revealed by the RBD-hACE2 complex 
crystal structure of RBD-hACE2 complex (Li et al., 2005a). The underlined residues in the NL63 CMB are those that were experimentally shown to be important for hACE2 
binding (Fig. 4). (B) Structural basis for the enhanced receptor binding of N578Y. The left panel illustrates that residue Y475 of SARS-CoV forms a hydrogen bond with Y83 of 
hACE2 and has two weak Van der Waals interactions with T27 and K31 of hACE2. The right panel shows the loss of hydrogen bond when Y475 is changed to N475, mimicking 
the corresponding site in NL63 (N578), while the weak interactions with T27 and K31 are still retained. (C) Structural basis for the drastically reduced receptor binding of 
E582A. The left panel shows the hydrogen bond formed between N479 of SARS-CoV and H34 of hACE2. The middle panel illustrates that when N479 is changed to E479, 
mimicking the corresponding site in NL63 (E582), the contact between virus and receptor is strengthened by formation of a salt bridge that is stronger than a single hydrogen 


bond. The right panel depicts the loss of contact when E479 is changed to A479. 


icking the site in NL63 (E582), a stronger RBD-hACE2 complex is 
formed through a salt bridge (negative—positive charge interaction) 
between E479 and H34 (middle panel in Fig. 6C). When this residue 
is changed to A, this interaction is lost (right panel in Fig. 6C). 
Taken together, we have established and optimized an efficient 
pseudotype viral system for NL63 and used it to characterize the 
role of the cytoplasmic tail of the S1 domain and further ana- 
lyzed the involvement of 19 residues in cell surface accumulation, 
receptor binding and pseudotype virus entry. This viral system 
would also be useful for investigation of the effect of NL63-S 
on hACE2 expression and shedding, which may help determine 
whether the difference in pathogenicity is related to the effect on 
the hACE2 expression/shedding. Lastly, information obtained here 
not only provides insights into the studies of molecular mechanism 
of S-receptor interaction and the function of NL63 S protein, but 
also provides important new information useful in development of 


effective vaccines, small peptides, and compounds or neutralizing 
antibodies to block NL63 infections. 
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